Optimising financial computation for reconfigurable hardware

نویسنده

  • Qiwei Jin
چکیده

This thesis proposes novel methodologies for design, optimisation and generalisation of reconfigurable hardware based finance computation. The applications of the proposed methodologies to numerical methods which are commonly used in the finance industry, such as Monte Carlo and Finite difference are studied in detail. These studies show reconfigurable hardware can effectively improve performance and energy efficiency in finance computation. There are three contributions. First, an application independent Monte Carlo framework for interest rate derivatives payoff evaluations based on the HeathJarrowMorton (HJM) mathematical Framework. By identifying three levels of functional specialisations in the model, the framework is able to retain good performance while supporting multiple applications. In addition, a process is proposed for the Monte Carlo framework to identify the optimal reduced precision data representation, in order to utilise hardware resource better and retain output numerical accuracy. The automatically generated Field-Programmable Gate Array (FPGA) implementations show significant speedup and energy saving over comparable Central Processing Unit (CPU) and Graphical Processing Unit (GPU). Second, a novel framework for accelerating option payoff evaluation based on finite difference method. The parallelism of the proposed architectures is exploited based on two levels of computational granularities. The implementations are generated based on a high level description. Significant speedup and energy savings are archived comparing our FPGA designs over both CPU and GPU designs. Third, a novel performance optimisation process based on dynamic reconfiguration for stencil computation. By optimally adjusting the underlying numerical procedure and making use of carefully chosen coefficients for constant multipliers, both the hardware resource consumption per kernel and the amount of computation needed per problem are reduced, and the numerical accuracy requirements are also met. Significant speedup is shown by comparing the optimised dynamic design with the unoptimised dynamic design and the original static design.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real-Time, Dynamic Hardware Accelerators for BLAS Computation

This paper presents an approach to increasing the capability of scientific computing through the use of real-time, partially reconfigurable hardware accelerators that implement basic linear algebra subprograms (BLAS). The use of reconfigurable hardware accelerators for computing linear algebra functions has the potential to increase floating point computation while at the same time providing an...

متن کامل

Architectural Modeling for Hardware and Software in Reconfigurable Embedded System

In Recent high-performance embedded system technology, applications are demanding very high performance from system with minimum available hardware resource. Due to this fast development, developer integrates more functionality into a single chip (Hardware). Reconfigurable architectures can adapt the behavior of the hardware resources to a specific computation that needs to be performed. Comput...

متن کامل

Improved Configuration Prefetch for Single Context Reconfigurable Coprocessors

Current reconfigurable systems suffer from a significant overhead due to the time it takes to reconfigure their hardware. In order to deal with this overhead, and increase the compute power of reconfigurable systems, it is important to develop hardware and software systems to reduce or eliminate this delay. In this paper we propose one technique for significantly reducing the reconfiguration la...

متن کامل

A Self-optimising Simulator for a Coarse-grained Reconfigurable Array

This paper describes the implementation and testing of a highspeed simulator for a reconfigurable processor architecture named MCGREP. The architecture is based on a coarse-grained array of small processors controlled by reconfigurable microcode. A high-speed simulator is needed to allow complex experiments to be carried out on MCGREP, involving large applications and time-consuming computation...

متن کامل

The Cameron Project: High-Level Programming of Image Processing Applications on Reconfigurable Computing Machines

Reconfigurable computing maps computation onto flexible and reprogrammable hardware. A typical reconfigurable computing (RC) system consists of a host processor (with a traditional architecture) and one or more reconfigurable coprocessors. Proposed hardware architectures for reconfigurable co-processors fall in two broad categories [4]: netlist computers with uniform arrays of fine grained logi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013